SITE LINK

KMID : 1022420110030010087

Phonetics and Speech Sciences
2011 Volume.3 No. 1 p.87 ~ p.94

Two-step a priori SNR Estimation in the Log-mel Domain Considering Phase Information

Lee Yun-Kyung

Kwon Oh-Wook

Abstract

The decision directed (DD) approach is widely used to determine a priori SNR from noisy speech signals. In conventional speech enhancement systems with a DD approach, a priori SNR is estimated by using only the magnitude components and consequently follows a posteriori SNR with one frame delay. We propose a phase-dependent two-step a priori SNR estimator based on the minimum mean square error (MMSE) in the log-mel spectral domain so that we can consider both magnitude and phase information, and it can overcome the performance degradation caused by one frame delay. From the experimental results, the proposed estimator is shown to improve the output SNR of enhanced speech signals by 2.3 ㏈ compared to the conventional DD approach-based system.

KEYWORD

phase modeling, speech enhancement, speech separation, MMSE, decision-directed, a priori SNR

FullTexts / Linksout information

Listed journal information

site infomation

Prohibition of Unauthorized Collection of E-mail Addresses, medric.kyung@gmail.com
N4 301, Chungbuk National University, Chungdae-ro 1, Seowon-Gu, Cheongju, Chungbuk 28644, Korea